Skip to main content

More right-solid
More right-solid
Show sorted alphabetically
More right-solid
Show sorted alphabetically
More right-solid
More right-solid
SHOW DETAILS
eye
Title
Date Reviewed
Review
С канала "Все работы хороши" - https://www.youtube.com/channel/UCkbSaWqttPHTS00K0fjniTQ +++++++++++++++++++++++++++++++++ Братюня! В этом выпуске AsSa расскажет все о работе в магазине магнит. Ты узнаешь все о компании Тандер. Сколько зарабатывают в продавцы в магазине Магнит. Как тебя обманывают с ценниками. Все...
Topics: video, youtube, blog, job, shop, magnit

Community Video
movies
eye 167
favorite 1
comment 0

 but Whole
Topic: but Whole
Source: torrent:urn:sha1:f9eb28e2ce7eaf24c918bf405bf6896afac3da69

Хороший учебник для филологических факультетов
Topic: Latin language

CISCO Umbrella Top 1 Million Domains
collection
153
ITEMS
20.2M
VIEWS
-
collection
eye 20.2M

web_domain_tests
web_domain_tests
collection
8,482
ITEMS
36.1M
VIEWS
-
collection
eye 36.1M

WARCs from internal crawl testing.
Topics: web, cctld

This is a booklet for a charity auction done during the Roger Waters Live At Berlin 1990 event. It contains various types of rare art from Pink Floyd and Gerald Scarfe Scanned at 600dpi
Topics: Pink Floyd, The Wall, Berlin, 1990, Auction, Book, South Kensington, Animation, Art, 10:30 AM,...

Brewster Kahle
Brewster Kahle
collection
73
ITEMS
78,929
VIEWS
-
collection
eye 78,929

Books donated by Brewster Kahle
Topic: kahle

OpenCitations
OpenCitations
collection
677
ITEMS
6.5M
VIEWS
-
collection
eye 6.5M

The main output of the  Open Citations  Project is the creation of the  Open Citations Corpus (OCC) , an open repository of scholarly citation data made available under a  Creative Commons public domain  dedication, which provides in RDF accurate citation information (bibliographic references) harvested from the scholarly literature. These are described using the  SPAR Ontologies  according to the  OCC metadata model , and...
Topic: scholarly citation

Community Texts
texts
eye 230
favorite 1
comment 0

Archive of a thread on http://boards.4chan.org/g/thread/51234439/ptg-private-tracker-general
Topic: 4chan

Community Texts
web
eye 867
favorite 1
comment 0

Uploads from the Artzie Music YouTube channel as available on 2020-10-23 saved to WARC.

Хотите восстать из мёртвых? Отключите и телевизор, и Интернет! Хотя бы на месяц. И читайте Л.Толстого: http://rus.earthlyfireflies.org/2017/12/27/tolstoy-books-list.
Topics: earthlyfireflies.org, earthlyfireflies, earthly fireflies, реклама, телевидение,...

Community Software
software
eye 103
favorite 1
comment 0

Virtual Machine for the Archiveteam Warrior http://archiveteam.org/index.php?title=Warrior
Topics: archiveteam, ova

perma_cc
web
eye 6
favorite 1
comment 0

Perma.cc archive of https://www.ftc.gov/sites/default/files/documents/cases/1999/02/9823015cmp.htm created on 2019-09-22 20:21:50+00:00.

Ourmedia
image
eye 3.8M
favorite 1
comment 0

Logos for Internet Archive projects
Topic: logos

Community Video
movies
eye 4
favorite 1
comment 0

Getting to Know Jesus Week 13
Topics: Canton Wesleyan Church, Pastor David Vos

Community Images
image
eye 365
favorite 1
comment 0

4chon's 2013 IRC Logs, leaked by Hsargz in the aftermath of the collapse of 4chon.
Topics: 4chon, IRC

Community Images
image
eye 18
favorite 1
comment 0

allo yoba eto ti?
Topic: meme

No generated pdf
collection
117
ITEMS
589
VIEWS
-
collection
eye 589

Community Video
movies
eye 32
favorite 1
comment 0

ANOIR youtube channel, archived because of (false?) copyright strike on one removed video
Topics: ANOIR, youtube, tarball

Internet Archive crawldata from National Library of Australia 2020 test domain crawl, captured by wbgrp-svc283.us.archive.org:NLA-AU-CRAWL-TEST-2020 from Wed Feb 26 01:07:56 PST 2020 to Tue Feb 25 18:34:04 PST 2020.
Topic: crawldata

Web Collections
Web Collections
collection
13
ITEMS
13,820
VIEWS
-
collection
eye 13,820

Web Collections organized by year. Some of this data is currently not publicly accessible.
Topic: crawl

Community Texts
data
eye 2
favorite 1
comment 0

warc created using grab-site
Topics: warc, archiveteam

Hotel Luxembourg (prima Albergo Marconi) e Hotel Palace, Piazzale della Libertà, Senigallia, cartolina viaggiata nel 1968.
Topics: Senigallia, Regione Marche, Italy, Spiaggia di Senigallia, Italia, Rotonda a Mare, Hotel...

Internet Archive crawldata from National Library of Australia 2020 test domain crawl, captured by wbgrp-crawl008.us.archive.org:NLA-AU-CRAWL-TEST-2020 from Sat Feb 29 15:40:52 PST 2020 to Sat Feb 29 09:07:49 PST 2020.
Topic: crawldata

Community Video
Dec 1, 2020
movies
eye 94
favorite 1
comment 1

This is an old funny video made in Windows Movie Maker or another video editor of 2000's. The author and exact date are still unknown. Though the year of video creating can be assumed as 2007-2008 due to a joke about Mikhail Saakashvili, pop songs of 2000s in video and memories of people who had this video on their mobile phones in their childhood (see comments in https://www.youtube.com/watch?v=ewzS2Xe2QFI ) The video is just a funny compilation of various clips from toons and movies which...
Topics: old video, mobile phone, cell phone, low resolution, digital media, digital video, windows movie...

М.: Художественная литература
Topics: A300, СВЛ

WARCZone: Outsider WARCs
data
eye 16
favorite 1
comment 0

WARCZone: Outsider WARCs
data
eye 1,082
favorite 1
comment 0

In 2013, Vyrd discovered that some anon (whose native language is Finnish) had this curious, undocumented public archive of very, very early handarchived threads from 4chan (along with many other period-appropriate .swfs, videos, and images), spanning all the way from 2004-2008 . Apparently this may have simply been a personal home server that made it's files publicly accessible. This repository is known as the Penfifteen Archive , and is the most important discovery of early 4chan threads to...
Topics: 4chan, penfifteen, vyrd, studionyami, Bibliotheca Anonoma

Books group test collection
texts
eye 178
favorite 1
comment 0

North Korean english newspaper brought back by a visitor and scanned at the Internet Archive.
Source: folio

Учебник Истории 2007г.
Topic: История России

TikTok
TikTok
collection
1,372
ITEMS
53,075
VIEWS
-
collection
eye 53,075

Community Video
movies
eye 13
favorite 1
comment 0

https://www.youtube.com/channel/UC6eVhchHpOLDNgtWAVZImIA
Topics: l.v.g tv, youtube poop, lvg, l.v.g, youtube, ytp, parody

Community Audio
audio
eye 722
favorite 1
comment 0

Atom Heart Mother
Topics: Pink, floyd

Обзор шоу Давай поженимся от Научи хорошему.
Topics: научи хорошему, whatisgood, обзор, видеообзор, пропаганда,...

Community Data
data
eye 9
favorite 1
comment 0

Backup of Microsoft Support articles and images using API. All languages, and de-duped.
Topics: microsoft, support

Community Video
movies
eye 36
favorite 1
comment 0

Первый советский художественный звуковой фильм. История о перевоспитании подростков в Болшевской трудовой коммуне. Педагогическая поэма. Иносказание социалистической революции. Миф с жертвоприношением.
Topics: звуковой фильм, СССР, история

Survey Crawl of .org Sites
collection
1,167
ITEMS
54.6M
VIEWS
-
collection
eye 54.6M

WikiTeam
texts
eye 2
favorite 1
comment 0

Dump (forzoso) del wiki denominado WikicharliE. El sitio chileno se autodefine como " Enciclopedia Virtual de Chile" . "Somos un proyecto innovador chileno, creado como un sistema de información histórico general, donde damos vida a un hermoso proyecto, haciendo relevancia, a que su contenido tenga relación con la historia, cultura y el pueblo de Chile". El sitio tiene muchas carencias, y es extremadamente ideologizado (de derecha). Además, indica que es actualizado por...
Topics: wikicharlie, Chile, enciclopedia, encyclopedia, MediaWiki, wiki, wikiteam

Community Audio
audio
eye 933
favorite 2
comment 0

Pink Floyd Album released 1975
Topics: Pink Floyd - 01 - Shine On You Crazy Diamond (Part One).mp3, Pink Floyd - 02 - Welcome To The...

Data crawled by Common Crawl on behalf of Common Crawl, captured by crawl850.us.archive.org:common_crawl from Fri Aug 7 17:28:51 PDT 2020 to Thu Sep 17 09:54:19 PDT 2020.
Topic: crawldata

Archive Team: Chromebot Collection
web
eye 1,254
favorite 1
comment 0

Topics: archiveteam, crocoite, chromebot

Archive Team: Chromebot Collection
web
eye 1,119
favorite 1
comment 0

Topics: archiveteam, crocoite, chromebot

Archive Team: Chromebot Collection
web
eye 3,817
favorite 1
comment 0

Topics: archiveteam, crocoite, chromebot

Document Cloud
texts
eye 4
favorite 1
comment 0

Source: http://www.documentcloud.org/documents/5770999-Додаток5.html

web_locrl
data
eye 0
favorite 1
comment 0

Data crawled by Common Crawl on behalf of Common Crawl, captured by crawl851.us.archive.org:common_crawl from Mon Sep 21 17:50:57 PDT 2020 to Mon Oct 12 07:45:36 PDT 2020.
Topic: crawldata

The Yotsuba Society Archives, which held publicly viewable 4chan thread archives, handmade by Jkid. Includes the Moot Video Archive as a bonus. This was obtained from RebeccaBlackTech from an unknown source, possibly direct web scrape.
Topics: Bibliotheca Anonoma, 4chan, Yotsuba Society

Новая пора года, новый сезон, новый формат. Саша покинул проект, но обещал вернуться Evernote и Moleskin: бумажная электроника Новость недели: Samsung проиграл суд Apple NASA: дешевые спутники на Android Новости РБ: наш спутник прислал первые фото. Но мы их вам не покажем Yahoo dream-team! На этой...

Community Data
texts
eye 388
favorite 2
comment 0

Complete XML backup as of 2018-11-04
Topics: wiki, xml, encyclopedia dramatica

Dnipro Regional Scientific Library. Ukraine
Dnipro Regional Scientific Library. Ukraine
collection
95
ITEMS
1,706
VIEWS
-
collection
eye 1,706

Dnipro Regional Scientific Library, one of the oldest Ukrainian libraries, which was founded on May 9 (22), 1834.
Topics: Yekaterinoslav, history, Ukraine, music, periodicals

GitHub Archive Program
collection
5,784
ITEMS
3.6M
VIEWS
-
collection
eye 3.6M

Archive Team: Chromebot Collection
web
eye 2,565
favorite 1
comment 0

Topics: archiveteam, crocoite, chromebot

500 Years of Images
collection
519
ITEMS
1,646
VIEWS
-
collection
eye 1,646

Outlinks From Tweets
collection
19,306
ITEMS
62.5M
VIEWS
-
collection
eye 62.5M

Data crawled by Common Crawl on behalf of Common Crawl, captured by crawl850.us.archive.org:common_crawl from Sat Aug 15 00:54:53 PDT 2020 to Thu Sep 17 11:25:50 PDT 2020.
Topic: crawldata

Wikipedia page.
Topics: wikipedia, offline, pdf, page, mediawiki, 2020-07-28, eo, Esperanto, eowiki, Nigrapieda lingvo

A last minute grab of the Myst Online forums prior to going read only. Myst Online: Uru Live is an open source massively multiplayer online adventure game developed by Cyan Worlds.
Topic: Myst Online, MystOnline

Around The World Crawl
Around The World Crawl
collection
2,150
ITEMS
329.9M
VIEWS
-
collection
eye 329.9M

Data crawled by Sloan Foundation on behalf of Internet Archive

Top 150 Crawl
Top 150 Crawl
collection
30
ITEMS
5.2M
VIEWS
-
collection
eye 5.2M

Top 150 Alexa sites crawl performed by Internet Archive. This data is currently not publicly accessible.

Archive Team: A Miscellaneous Smattering of Panic
Archive Team: A Miscellaneous Smattering of Panic
collection
20
ITEMS
103,076
VIEWS
-
collection
eye 103,076

-
collection
eye 3.1M

Data collected by Internet Archive on behalf of the Fundacao para a Computacao Cientifica Nacional of Portugal. This data is currently not publicly accessible.

Data crawled by Common Crawl on behalf of Common Crawl, captured by crawl851.us.archive.org:common_crawl from Sat Sep 19 14:20:21 PDT 2020 to Mon Oct 12 07:10:22 PDT 2020.
Topic: crawldata

Arkive
Arkive
collection
205
ITEMS
29,073
VIEWS
-
collection
eye 29,073

Backup videos and channels relating to survival, firearms, militaria, and other such topics.
Topic: arkive, youtube

Archive Team: The Viddy Tune Out
Archive Team: The Viddy Tune Out
collection
376
ITEMS
3,333
VIEWS
-
collection
eye 3,333

Remember Viddy? The one-time red hot mobile video app is closing down. That’s according to Fullscreen, the company that bought it for $20 million — a snip of its peak valuation — earlier this year. A post on the Viddy blog — first spotted by App Advice — explains that the Viddy app was removed from the App Store and Google Play on November 4. Anyone who already installed either app has until December 15 to use it, after which the service will officially shut down. Those wishing to...

Cuil Crawl Data
Cuil Crawl Data
collection
0
ITEMS
22M
VIEWS
-
collection
eye 22M

Web crawl snapshot generously donated from cuil.com . This collection of pages mostly from 2007 and some from 2008, is about 310 terabytes of compressed data, and almost 60 billion URLs (mostly text). Cuil was a search engine that organized web pages by content and displayed relatively long entries along with thumbnail pictures for many results. Cuil said it had a larger index than any other search engine, with about 120 billion web pages. It went live on July 28, 2008. Cuil's servers were shut...

Corentin Barreau's Web Archives
Corentin Barreau's Web Archives
collection
1,251
ITEMS
2.6M
VIEWS
-
collection
eye 2.6M

Various web content archived by Corentin Barreau .

Archive Team: Preposterous! The Posterous Grab
Archive Team: Preposterous! The Posterous Grab
collection
447
ITEMS
1.6M
VIEWS
-
collection
eye 1.6M

Posterous will turn off on April 30 : Posterous launched in 2008. Our mission was to make it easier to share photos and connect with your social networks. Since joining Twitter almost one year ago, we’ve been able to continue that journey, building features to help you discover and share what’s happening in the world – on an even larger scale. On April 30th, we will turn off posterous.com and our mobile apps in order to focus 100% of our efforts on Twitter. This means that as of April 30,...

web-group-internal
collection
32,997
ITEMS
1.1B
VIEWS
-
collection
eye 1.1B

miscellaneous data
Topic: brad tofel

Uzerus Crawls
collection
1,638
ITEMS
946,345
VIEWS
-
collection
eye 946,345

Uzerus web crawls
Topics: Uzerus web crawls, archiveteam

Журнал представляет собой  периодическое издание Органа Екатеринославского Общества Пчеловодства. На его страницах представлены: протоколы заседаний товарищества, различные методы ведения пчеловодного хозяйства; информация о подсолнечнике, как о медоносном растении;...
Topics: Екатеринослав, периодика, журнал, пчеловодство

[Франкфурт-на-Майне]: Посев, 1946. — 79 с. Научное и философское значение ряда коперниканских переворотов, происшедших в недрах современной физики, еще далеко не вполне осознано, тем более, что революция в физике еще не закончена. Тем не менее, если рядовой западный любитель...
Topics: материализм, materialism, физика, physics, теория...

University of Michigan Books
texts
eye 234
favorite 1
comment 0

Bibliographical footnotes

Archive Team: Chromebot Collection
web
eye 648
favorite 1
comment 0

Topics: archiveteam, crocoite, chromebot

Archive Team: Chromebot Collection
web
eye 7,887
favorite 1
comment 0

Topics: archiveteam, crocoite, chromebot

Alexa Crawls
web
eye 6,624
favorite 1
comment 0

Alexa crawl
Topic: crawldata

collection
eye 40

Alex Krey - professional illusionist, showman, finalist of show "Phenomenon" on TV channel "Russia", an operating member of the International Brotherhood of Illusionists, prize-winner international illusion festival in Turkey, the creative actor who is going on tour on the world. Алекс Крэй - профессиональный иллюзионист, шоумен, финалист шоу «Феномен» на телеканале «Россия»,...
Topics: podcasts, Alex Krey illusionist - Алекс Крэй иллюзионист

Catalogs Collection
texts
eye 12
favorite 1
comment 0

COL-875 catalog
Topic: classical

Data crawled by Common Crawl on behalf of Common Crawl, captured by crawl851.us.archive.org:common_crawl from Sat Sep 26 00:48:05 PDT 2020 to Mon Oct 12 09:14:27 PDT 2020.
Topic: crawldata

Wayback Robots Crawl
Wayback Robots Crawl
collection
129
ITEMS
5.9M
VIEWS
-
collection
eye 5.9M

Wayback robots.txt crawl performed by Internet Archive. This data is currently not publicly accessible.

Wikipedia Articles (PDF Versions)
texts
eye 6
favorite 1
comment 0

Wikipedia page.
Topics: wikipedia, offline, pdf, page, mediawiki, 2020-07-28, ru, Russian, ruwiki, Asaphida

Data crawled by Common Crawl on behalf of Common Crawl, captured by crawl850.us.archive.org:common_crawl from Fri Aug 14 10:06:02 PDT 2020 to Thu Sep 17 11:17:37 PDT 2020.
Topic: crawldata

Data crawled by Common Crawl on behalf of Common Crawl, captured by crawl851.us.archive.org:common_crawl from Mon Sep 21 17:50:57 PDT 2020 to Mon Oct 12 07:45:43 PDT 2020.
Topic: crawldata

Community Texts
Mar 1, 2020
texts
eye 188
favorite 2
comment 1

Complete XML backup as of 2019-03-09
Topics: wiki, xml, encyclopedia dramatica

Academic Torrents
data
eye 411
favorite 1
comment 0

This is a collection of Geocities data downloaded by a bunch of people who call themselves ARCHIVE TEAM, who began scraping the Yahoo! Geocities site during a six month period in 2009, before Yahoo! shut down geocities.com on October 26th, 2009. This collection is compressed in a UNIX filesystem with both 7zip archives and tape archives (gtar). If you're a bit of a data tourist and just want to waft in the scent of a web era gone by, please go to one of the Geocities mirrors that were put up in...
Source: http://academictorrents.com/details/2dc18f47afee0307e138dab3015ee7e5154766f6

Whitepaper for the Library Leaders Forum held at the Internet Archive in October 2016. Transforming Our Libraries into Digital Libraries: A digital book for every physical book in our libraries Brewster Kahle, Internet Archive Library Leaders Forum Discussion Document, October 2016 Today, people get their information online—often filtered through for-profit platforms. If a book isn’t online, it’s as if it doesn’t exist. Yet much of modern knowledge still exists only on the printed page,...
Topics: Brewster Kahle, Digital Libraries

Community Texts
texts
eye 19,532
favorite 2
comment 0

Kievlyanin gazette, year 1917.
Topic: Kiev

Alexa Crawls
web
eye 3,581
favorite 1
comment 0

Alexa crawl
Topic: crawldata

Fix Broken Links Web Crawls
Fix Broken Links Web Crawls
collection
120,252
ITEMS
2.6B
VIEWS
-
collection
eye 2.6B

These crawls are part of an effort to archive pages as they are created and archive the pages that they refer to. That way, as the pages that are referenced are changed or taken from the web, a link to the version that was live when the page was written will be preserved. Then the Internet Archive hopes that references to these archived pages will be put in place of a link that would be otherwise be broken, or a companion link to allow people to see what was originally intended by a page's...

Community Data
data
eye 534
favorite 2
comment 0

An archive of the r/gamersriseup subreddit
Topic: Reddit

Data crawled by Common Crawl on behalf of Common Crawl, captured by crawl850.us.archive.org:common_crawl from Tue Aug 11 05:54:49 PDT 2020 to Thu Sep 17 10:36:00 PDT 2020.
Topic: crawldata

Alexa Crawls
web
eye 0
favorite 1
comment 0

Alexa crawl
Topic: crawldata

Internet Archive crawldata from feed-driven Twitter Outlinks Crawl, captured by crawl863.us.archive.org:twitter from Sat Oct 17 13:15:32 PDT 2020 to Sat Oct 17 09:51:29 PDT 2020.
Topics: twitter, crawldata

Archive Team: Chromebot Collection
web
eye 2,109
favorite 1
comment 0

Topics: archiveteam, crocoite, chromebot

Archive Team: Chromebot Collection
web
eye 718
favorite 1
comment 0

Topics: archiveteam, crocoite, chromebot

Archive Team: Chromebot Collection
web
eye 1,283
favorite 1
comment 0

Topics: archiveteam, crocoite, chromebot

Archive Team: Chromebot Collection
web
eye 898
favorite 1
comment 0

Topics: archiveteam, crocoite, chromebot

Archive Team: PLAYS.TV Replays
Archive Team: PLAYS.TV Replays
collection
6,578
ITEMS
1M
VIEWS
-
collection
eye 1M

An Archive Team group grab of the game streaming site PLAYS.TV.  "Plays.tv is one of the best ways to record, review, and share your gameplays. The app has been known to be  In fact, it is one of the most popular recording softwares used for League of Legends. Unfortunately, everything good must come to an end. The developers of Plays.tv announced that the site is closing down. They will also discontinue support for the website and desktop application on December 15."

Biblioteca Nazionale Centrale di Firenze
Biblioteca Nazionale Centrale di Firenze
collection
224
ITEMS
13M
VIEWS
-
collection
eye 13M

Data collected by Internet Archive on behalf of Biblioteca Nazionale Centrale di Firenze. This data is currently not publicly accessible.